CONVERGENCE OF THE CASCADE ALGORITHM AT 
IRREGULAR SCALING FUNCTIONS 

OLA BRATTELI AND PALLE E. T. JORGENSEN 



Abstract. The spectral properties of the Ruelle transfer operator which arises 
from a given polynomial wavelet filter are related to the convergence question 
for the cascade algorithm for approximation of the corresponding wavelet scal- 
ing function. 
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1. Introduction 

Two operators from wavelet theory are studied: the refinement operator (alias 
the cascade approximation operator) M (see (2.9)), and the transfer operator R 
(see (2.25)). In the case of a compactly supported scaling function tp, we then 
consider the approximation problem 

(1.1) limM"V (0) =^ inL 2 (R), 

n — >oo 

where V'*- '' & L 2 (R) is given. A result is proved which relates the spectrum of R 
to the question of when ip = linin^oo M n i)^°\ For those vectors ip^ where the 
approximation holds, the rapidity of the approximation is related to the spectral 
data for R. 
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2. General Theory 

It is well known that compactly supported scaling functions ip of a multiresolution 
analysis satisfy the functional equation 

JV 

(2.1) ip(x) = V2^2a k ip(2x - k) 

(at least after an integer translation of tp) [Dau92, CoRy95]. The standard re- 
quirement that {ip ( • — k) | k G Z} forms an orthonormal set of functions in L 2 (R) 
implies the conditions 

(2.2) ^ a k a k+2 i = Si 

fcez 

for all / G Z, and the second standard requirement that (p (0) = 1 translates into 
the condition 



(2.3) ^> fc = V2. 



If 



(2.4) m (*) = E 



for z = e lt e T, condition (2.2) is equivalent to 

(2.5) |m (z)| 2 + |m (-z)| 2 = 2, 
and (2.3) is equivalent to 

(2.6) m (l) = \/2. 

However, in general, orthogonality of {iy9 (x — k) \ k G Z} is a condition which is 
more restrictive than either one of the two equivalent conditions (2.2) or (2.5). The 
Fourier transform of (2.1) is 

(2.7) ^=^°(£K£)- 

We apologize to engineers for using t to denote frequency. Since ip has compact 
support (and then (2.1) implies that the support is in [0,7V]), ip is continuous at 
and an iteration of (2.7) gives 

Since mo is a polynomial, this expansion converges uniformly on compacts. 

Now, let ip(°*> be any bounded function of compact support such that ip( 1 (0) = 1, 
and define by iteration 

(2.9) (x) = (mV> ( ™ } ) (x) 

N 

= y/2^2a k i>^(2x-k). 



k=0 
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Then 



(2.10) ^+1) (t) = _L mo (0 ^(n) ^ 



and hence 



(2.ii) («) = n f m ° ( *r fc) > ) («2-) 



fc=l 



V2 ; 



As a result, i/^™* 1 — > uniformly on compacts, and thus ip^ — ► </? in the 

n^oo n^oo 

distribution sense. In short, if {ak \ k = 0, . . . , iV} is a finite set of coefficients 
satisfying (2.3) alone, the refinement equation (2.1) possesses a distribution solution 
tp with tp (0) = 1 and compact support in [0, N]. This solution is defined by (2.8), 
and is given as the distribution limit 

(2.12) tp = lim M"V (0) , 

where is an Y integrable function of compact support such that tpW (0) = 1. 
When this is used to depict the graph of tp, it is common to take tp^ to be the 
Haar function 



(2.13) V (0) (x) 



1 if0<z<l, 
otherwise; 



see [Coh92]. In this paper, we discuss a variety of choices. The question of when 
the convergence in (2.12) is stronger than distribution convergence, and the related 
question of what regularity properties the limit tp has, have received much attention 
in the literature; see for example [Str96], [Vil94], [CoDa96], [Pol92], and references 
cited therein. In all these references, it is also assumed that the condition (2.2), 
alias (2.5), holds. If 

(2.14) «,^n(ip) 

is the n'th partial product in (2.8) one then shows, using (2.5), that 
(2.15) 

\pn(t)\ 2 dt= \Pn{t)f dt 

>7T JO 

I 71 7T 

= J \<p n -i (t)\ 2 \ (| mo (2-"t)| 2 + | mo (2~ n t + tt) | 2 ) 

/ l 2 71 7T p7T 

= \p n - x (t)\ 2 dt = ■ ■ ■ = \p (t)\ 2 dt = 2ir. 

JO J— IT 

Since the convergence tp n — ► tp is uniform on compacts, it follows that 



dt 



(2.i6) \m<27r. 

Hence tp e L 2 (R), and \\tp\\ 2 < 1. Since y>„ ( • ) X[-n,ir] ( ' 2~ n ) converges, as n — > oo, 
uniformly on compacts, and has constant L 2 -norm equal to \/27r by (2.15), it follows 
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that this sequence converges weakly to (p in L 2 (R). Consequently it converges in 
L 2 -norm to (p if and only if 

(2.17) 110112 = 2tt, 
i.e., if and only if 

(2.18) IMI 2 = 1- 

It is well known that this is equivalent to several other more practical conditions, 
like (2.19)-(2.23) below: 

(2.19) {ip ( ■ — k)} keZ is an orthonormal set in L 2 (R). 

(2.20) J^kez (t + 27rfc )| 2 = 1 fo r all t G R. 

(2.21) The only trigonometric polynomials £ satisfying 

are the constants. 

(2.22) There is no nontrivial cycle in 

{z G T | |m (z)| = V2} - {z G T | \m {-z)\ - 0} 

for the doubling map zhz 2 . 

See for example [Hor95, Theorem 3.3.6] or [CoRy95, Chapter 2] for details. We will 
especially need (2.21) in the sequel. By the previous remarks we also note that this 
is equivalent to: 

(2.23) The cascade algorithm, with 

^ (0) (*) - *[-*,*] (x) = ^ ^ e Ux dt=— sin („x) , 

converges in L 2 -norm to ip; 

and then {</?(• — k)} is an orthonormal set. 

In contrast to the distribution convergence, this latter convergence depends very 
sensitively on the choice of initial function ip(°\ If for example the starting 
vector above is replaced by sin (Nirx), then IjV'^l^ = N for all n, so 
cannot converge to tp in norm (although it does so weakly). The question is then 
which initial functions ip^ can be used. One approach, developed by Strang [Str96], 
establishes L 2 -convergence under general circumstances if is chosen such that 

(2.24) k 0) (-- fc )} feez 

is an orthonormal set. Thus the standard choice ip^ = X[o,i] i s included. To 
describe these "general circumstances" we introduce the Ruelle operator 

(2.25) m{z )= 1 -Y^ \mo{w)fZ{w). 

w 2 — z 

We may view R as an operator on any of the spaces 

(2.26) C [z, z- 1 ] C C (T) C L°° (T) C L 2 (T) . 

and in particular it is clear from (2.25), (2.4) that R maps any of these spaces 
into itself. Since R (C [z, ) C C [z, z _1 ] , the invariancc of C (T) follows from 
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the Stone- Weierstrass theorem. This also follows directly from (2.25), using the 
continuity of too. 

If P[n,m], n < to, is the subspace of C [z, z~ x ] consisting of trigonometric 
polynomials of the form Y^k= n bkZ k , we note that 

N — n\ ( m + N 



(2.27) R{P[n,m])cP 

where [x] is the largest integer < x. Thus any P [n, to] will ultimately be mapped 
into P [-N, N] by repeated applications of R, so all the spaces 

(2.28) P [-N, N]cC [z, z- 1 ] C C (T) C L°° (T) C L 2 (T) 

are invariant under R. 

Let us study the spectral properties and the norm of R on the various spaces. 
Note that £ (z) = 1 (the constant function 1) is an eigenvector of R with eigenvalue 
1 in all the subspaces; and, by (2.21), it is the unique eigenvector up to a scalar 
with eigenvalue 1 in C \z, if and only if \\ip\\ 2 = 1, i.e., if and only if the cascade 
(2.23) converges in L 2 -norm. In any case, (2.25) and (2.5) immediately imply that 

(2-29) Pllo™ - 1, 

and hence the spectral radius of R, as an operator on the four left-hand subspaces 
in (2.28), is 

(2.30) Pco (R) = 1. 

Let us note en passant that the behaviour of R as a operator on L 2 (T) is different. 
From (2.25), we get for the L 2 (T)-adjoint operator: 

(2.31) (R*Z)(z) = \m (z)\ 2 t(z 2 ), 
and hence 

(2-32) (RR*0 (z) = lYl K HI' ( R *0 H 

W 2 — z 

= ^J2 |to H| 4 £(z). 

W 2 — z 

Thus RR* is a multiplication operator, and it follows from (2.5) and (2.6) that 

(2.33) \\R\\ - \\RR*\\i = Q (V2) 4 ) ' = y/2, 

so R is not contractive, nor normal, on L 2 (T). 

What is the spectral radius? From (2.25) and (2.31), we verify the identities 

(2.34) (R n 0(z) = - ITh°K)| £ W 

w ^ =z k=0 

and 

n— 1 2 

(2.35) (R* n 0(z)=H\m (z 2k )\ £ (* 2 ") , 

fe=0 
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and hence 

1 11—1 4 

(2.36) {R n R *n 0{z) = _ J- nh°K)| 

holds. Thus R n R* n is the operator of multiplication by the function 
1 n ~ 1 4 

(2.37) - ]r nh°( w2fc )l =PnW. 

and it follows that the spectral radius of R is 
(2-38) P2 (R)= lim ||p n ||&. 

n-^oo 

Note that 

(2.39) Pn(z) = ^ \mo (w)] 4 Pn-i (w) 

w 2 —z 

and 

(2.40) po (z) = 1. 
But, by (2.5) and (2.6), we have 

(2.41) i<i £ KHI 4 <2, 

w 2 — z 

with both extremities attained. By (2.39) and (2.41), and induction, we further 
have 

(2.42) l<Pn(z)<2 n , 
and hence, by (2.38), 

(2.43) 1 < P 2 (R) < V2. 

After this diversion into Hilbert space, let us return to the cascade algorithm. 
Strang's criterion is based on the following basic lemma: If ipi , 1P2 are L 2 -functions 
of bounded support, define their relative polynomial by 

(2.44) p (Vi > 2 ) (z) - V z fc / Ji (x - k)4> 2 (x) dx. 

k 

Note that 

p(L 2 ([0 7 N}),L 2 ([0,N})) CP[-N,N], 

i.e., p maps L 2 ([0, N}) x L 2 ([0, N]) into a 2iV + l-dimensional space of trigonometric 
polynomials. 

Lemma 2.1. If M is the cascade transform defined by (2.9), then 

(2.45) p(Mip u Mip 2 ) = R(p(ip u ip 2 )) 

for any pair ip 2 of L 2 -functions of compact support, where R is the Ruelle 
operator (2.25) on C [z, z^ 1 ] . 
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Proof. If £ (z) — J2k Xkzk *= ^ [ z ' z _1 ] ' ^ s F° ur i er transform is (xk) keZ , and the 
Ruelle operator (2.25) transforms into 

(2.46) (** X ) k =J2J2^ aiX i- l + 2k - 

j i 

This follows from the next computation on 

(2.47) £(z) = 5> fe z fc . 

k 

We have 

(2.48) (i?0 (z) - i J] ^TH^o H £ H 

W 2 — Z jl k 

= ^EE d J a l x k w^+ l+k 

w 2 —z jlk 

= ^ EE a jai x^ l+n w n 

w 2 —z jln 

= ^E E w n ^2d j aix j -i +n . 



2 

n w 2 =z jl 



The terms with odd n disappear in the last sum, so 

(2.49) (RO (z) = E z k ^ajaiXj-i+ik, 

k jl 

and (2.46) follows. 

If ipi, ip2 are L 2 -functions with compact support, define 



(2.50) b k = / Vi (x - k)i> 2 (x) dx, 

JR 

(2.51) c fe = / (MV>i) (x-k) (M^ 2 ) (a;) dx. 
To prove (2.45), we have to show 

(2.52) c fc =(£&). 
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We have 
(2.53) 



Cfe 



(MV>i) (x - k) (Mfo) (x) dx 
= 2[^ a mM 2x-2k-m) a nM 2x-n)dx 

mn 

= [^-a m a n M x-2k-m) M x-n)dx 

J R mn 

= [j2-aman M X + n-m-2 k) M x)dx 

J R mn 

= ^2 ^ma n b m - n +2k = (^) ' 



This shows (2.52), and Lemma 2.1 is proved. 



□ 



We will list a few more preliminaries on the relative polynomials before we prove 
our theorem on spectrum and cascade approximation. As in Lemma 2.1, wc consider 
functions ip, ipi, ip2 in L 2 (K) of compact support. Usually the support of the 
functions in question will be assumed contained in a fixed interval [0, N] C M. 
Consider £ (z) = Y,k^z k G C C C(T), £ fe = f (fc) denoting the Fourier 

coefficients. Defining 

(2.54) (^*^)(x):=J2^(x-k), 

fcez 

we note that 

(2.55) (£*^n*)=£(e^HW, 

where ~ as usual refers to the R-Fourier transform. The sesquilincar operator 
Pi^ii^) in (2.44) will be viewed as a quadratic form 

(2.56) p: L 2 (R) c x L 2 (R) c — » C [z, z" 1 ] cC(T)c L 2 (T) , 

with the subscript c standing for compact support. 

Positivity of this C (T)-valued form p follows from the following identity: 

(2.57) pWsVO («) = J2 zk dx 



PER ( U 



e fc (i) PER 



(*)> 



(e u ) dt 



with the convention z = e lt , (t) = z k = e lkt , and 



PER(/) (e u ) =J2f(t + 2nn). 



Similarly we derive the formula 
(2.58) pty>i,^)(*)= PER 
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From this it is immediate that the relative polynomial p has the sesquilinearity 
property 



(2.59) 



P (V>i, i>i) (z) =p(ip2,ipi) 0), 



and that it is densely defined on L 2 (R) x L 2 (R). We next show that when ipi £ 
L 2 (R) c is given, then p (tpi, ■ ) extends boundedly to L 2 (R) in the second variable, 
and that it becomes a module mapping in that variable relative to C [z, z _1 ] C 
L°° (T). This is actually slightly more than what we need (see the proof of (ii) =>■ 
(i) in Theorem 2.5). On the other hand, one may use the Zak transform [Dau92] to 
prove a stronger continuity estimate: p may be extended to a sesquilinear function 
L 2 (R) x L 2 (R) -» L 1 (T) such that 



Wc turn to the details. 



Lemma 2.2. The relative polynomial p has the following properties: 
(a) Norm estimate: 



(2.60) \\p(^i,i>2)\\ L 2 m = ^J o |PER (&&)(*) 



dt 



< 



PER 



^1 



(b) Module property: 



(2.61) p tyut * V2) (z) = £ (z) p tyi , V2) (^) • 
(c) Ruelle operator covariance: 

(2.62) M mo (£ * V) - M moC (V) 



and t/iws 



(2.63) fl(p(^i,e*^)) (*) =P(M OTo ^i,M Cmo ^) 

w/iere (ran is ffce pointwise product. 
(d) Cascade smoothing: 

(2.64) (M™ (£ * V)H*) - £ (e~#) (M™i>) (t) , 
w/iere M = M mo is ffce cascade operator. 
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Proof. Ad (a): By Parseval's identity in L 2 (T), 



(2.65) 



1 c 

llp«'i,*2)lli, m = ^| \p»'i,M(e-")\ it 
fcez ^ R 



E 



^ / e k (i) Vi (t) 02 (t) dt 



1 /-27T 

5 s/. e ' (()PER (' 



2tt 



2tt 



/o 

PER (^ 2 ) 

2 



0i0 2 (t) dt 



< — / I PER 













2 


< 


PER 


■01 










2 




PER 












2 




PER 


■01 





-/ 



•PER 

2tt 



i'2 



PER 



02 



dt 



1 

' 2^ 

' / 10: 
./r 



02 



'2 (x)| 2 dx. 



,4d (b): We have 



(2.66) 



p(0i>£*02) (2) = ^z fc / 0i (a; - fc) (£* 2 ) (a;) da; 
feez ^ R 

= VV6^ / 0i Jx - fc)0 2 (x - da; 
= S 2 / 0i + ^ - *O02 (*) dx 

1. ; 



fe z 

C(Z)P(01,02) (2). 



Ad (c): Since 



(2.67) fl(p(^?*W)=P(^,MK*^)) 
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by Lemma 2.1, we need only calculate M (£ * ^2), where M is the cascade operator 
M = M mo given by the low-pass filter mo- 

(2.68) M (£ * ^2) (a;) = V2^ a k K * $0 (2a; - fc) 

fe 

= V2 ^ a fc ^ 6V2 (2a: - A - i) 
fe / 

= V2^2 ^2 ak£n-kip2 {2x - n) 

n fc 

= ( TO °0~ ( n ) ^2 (2a; - n) 
= M moi ip 2 (x) , 

where M mo £ is the cascade operator corresponding to the product filter (mo£) (2;) = 

™o (^K (z)- 

,4d (d): From the definition of the cascade operator M, we get (Mip)~(t) = 
-j^rn (|) ift (|). Now apply this to £ * V) anal use (£ * V'PM = £ ( e_l *) V' (*)■ 
Iteration yields 



* -0)' 



(2.69) (M n (£ * VOn*) - II ( Jf) (£ : 

=f( e "*)n^ m o(^)^ 

= ^(e~^) (AP^H*)- □ 

As a corollary, we note that 
Corollary 2.3. 

(2.70) p (L 2 (R) c x L 2 (R) c ) = C [z, z' 1 ] . 

Proof. We have already commented on one inclusion in (2.56), and the second 
follows from 

where i/' (0) e i 2 (R) c is chosen such that p (^ (0) , V' (0) ) = 1. 

(Equivalently, PER ^(°)^(°)^ = PER ^ ^ = 1.) □ 

Remark 2.4. The positive elements in C [z, z _1 ] (i.e., pointwise nonnegative as 
functions on T) arc of the form |£| 2 = by a theorem of Riesz [Dau92, Chapter 
6], [Akh65, p. 181], and we get |£| 2 = p (£ * V (0) , £ * ^ (0) )- 

We are now ready to prove the main theorem of convergence of the cascade 
algorithm, which is a version of Theorem 4 in [Str96] . 

Theorem 2.5. Let a , a\, . . . , ajv &e complex numbers satisfying (2.2)-(2.3) and let 
(p be the associated scaling function defined by (2.8). Identify the Ruelle operator R 
given in (2.25) with its restriction to P N] (or to any P [n, m] with n < —N, 
m> N). The following conditions are equivalent. 
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(i) R has 1 as a simple eigenvalue and |A| < 1 for all other eigenvalues A of R. 

(ii) Iftp^ e L 2 (R) is a function with compact support such that {^(°) ( • — fc)} fe 

is an orthonormal set and ip^ (0) = 1 i/ien 

p - M"V (0) 



(2.71) 



lim 

n— »oo 



o. 



Proof, (i) =^> (ii). As an aside, remark that i? having 1 as a simple eigenvalue means 
here that the corresponding cigcnspace is one-dimensional. But since ||i?|| = 1, and 
hence n i— > ||i?™||' is bounded in any equivalent norm || • ||' on the linear operators 
on P [-N, N] = C 2N+1 , it follows from Jordan's theorem that the multiplicity of 1 
in the characteristic polynomial is 1 too. 

Let us view P [-N, N] = ^J2k=-N x k zk } as the space of sequences x = (xk)^. 
Since 

(2.72) (i?0 (1) = \ (|m (1)| 2 i (1) + |m (-1)| 2 1 (-1)) 

for all £ e C [z, z^ 1 ] , we have 



N- 



(2.73) 



]T(i?x) (fc) = 5>(fc)- 



But as -R(<5o) = <5o, <5 is the unique eigenvector of R corresponding to eigenvalue 
1, and as the functional x ^ J2k x (^) ' s preserved by R, it follows from (i) that 



(2.74) 



for all finite sequences x. Thus, by Lemma 2.1, 
(2.75) p (</?, M"V (0) ) = P (M n V, M n iP^ 

= i?"(p(W°))) 



J <p{x-k)ipW (x) dx"j • a. 



But the two assumptions on ip^ imply the so-called Strang-Fix condition [Vil94, 
Str96] 

(2.76) J2^(x + k) = l 

k 

for all x, and hence by the above, 

(2.77) p(^,M"V (0) ) 
In particular this means that 

(2.78) lim ( ,:• 
But by (2.45), 
(2.79) 



<f (x) dx = <p (0) = 1. 

p(m"v(°),m"v (0) ) =fl n (p(V 0) ,^ (0) )) • 



1. 
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Since p ?/>(°)) = 1 by ortho-normality, we conclude that 



(2.80) 

and in particular 
(2.81) 

for all n. Also 

(2.82) 

so, finally, 

(2.83) 



(M n ip(°\M n i/jW 
M"V (0) 



R n t = 1, 



if - M"V (0) 



|M>||^ = |M|^ = 1, 



= \\<pf 2 -2Re^ M"V (0) )+ ^ (0) 
— > 1-2 + 1 = 0. 



(ii) => (i). We now assume cascade convergence in the sense (ii), i.e. 



(2.84) 



ip - M> (0) 







for the initial vectors tp which are specified in (ii). The object is to derive from 
this the spectral picture for R as specified in (i), and R will be identified with its 
restriction to P [-N, N] as mentioned. Of course P[-N,N] C C(T), and R is 
also, by (2.28), an operator mapping C (T) into itself. Its adjoint on the dual space 
of measures M (T) is given by (R*n) (C) = fJ- (R£) = J T (B4) (z) dfx (z), C e C (T). 
The Dirac point-measure 8\ £ M (T), given by Si (C) = C (1), is invariant by (2.5), 
i.e., R* (Si) = Si. 

Consider the eigenvalue problem: 

(2.85) i??o = A£ , AeC, Co eP[-iV,7V]\{0}. 

Then Si (Co) = <*i (i?Co) = A5i (Co), so 5i (Co) = Co (1) = if A ^ 1. We assume 
this, and since H-RH^^^ = 1, the discussion may be restricted to |A| = 1. We 
claim that, if A ^ 1, A e T, then Co = 0, so we cannot have nontrivial peripheral 
spectrum. 

By (ii), \\<p- M n V (0) || 2 -» 0, where i/> (0) is any initial vector with the stated 
conditions, e.g., ip^ — X[o,i]- Using Lemma 2.2(d), we also get 



(2.86) 



ip- M n (c* V (0) ) 



0. 



whenever £(1) = 1- For example, take C = 1 + cCo to have this satisfied. Then 
i?"C = 1 + cA"Co, i.e., a divergent sequence if A ^ 1 and c ^ 0, supposing Co 7^ 0. 
Using Lemma 2.2, we will show that 



(2.87) 



i?"C = p (m"v (0) , M" (c * V (0) ) ) 



where the last convergence is in the finite-dimensional subspace of C [z,;? -1 ] and 
thus in any norm. This will contradict the divergence of R n £. The formula (2.87) 
can be verified in two ways: since ip, M n (V> (0) ) and M n (C * ?A (0) ) all have sup- 
port inside a common compact set, the convergence is immediate from the fi- 
nite sum (2.44), (2.84), and (2.86). Alternatively one can use Lemma 2.2(a), 
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and PER V (0) 
M n (£ * V (0) ) 



1. In checking the conditions in Lemma 2.2(a), we note that 
£ (1) ip, in L 2 (R), so we must verify that 



PER 



( 



M"V (0) 



= R n l = 1. 



is bounded in n. But the function inside || • equals 

P (m"v (0) ,m"v (0) ) = R n (p (V 0) , V> (0) 

This contradiction completes the first part of the proof of (ii) =^> (i). 

It remains to show that (ii) of Theorem 2.5 implies that A = 1 has multiplicity one 
in the spectrum of R, where again R is identified with its restriction to P [-N, N]. 
Since Rt = 1, we need only exclude that the multiplicity is 2 or more. But 
||M n V (0) || 2 = 1 for all n by (2.45) and Rt = 1 and it follows from (2.71) that 
\\ip\\ 2 = 1. By (2.18) (2.21) it follows that A = 1 has multiplicity 1. □ 

3. Some examples 

Our interest in the subject of cascade approximation was ignited when using 
the cascade algorithm in [BEJ99] to draw the scaling function associated with the 
low-pass wavelet filter 

3 



(3.1) 
where 

(3.2) 



,(«) 



(*) = 5> 



k z 



k=0 



1 



2V2 
1 

1 

1 

2Vl 



(1 — cos0 + sin#) , 
(1 — cos 8 — sinO) , 
(1 + cos# — sinO) , 
(1 + cos 6< + sin 6<) ; 



and 9 varies over the circle; see [Pol89, Pol90]. This family is discussed in detail 
in [BEJ99]. The symmetry tp^-^ (x) = (p^ (3 - x) (see [BEJ99, Proposition 4.1]) 
implies that we may limit the discussion to the interval — ^ < 8 < -|. In this 
interval the conditions (2.17)-(2.22) are fulfilled with the one exception of 8 = -|, 
where 

4 f) w = ^(i+- 3 ) 



and the scaling function ip is given by 

<^(f ) (x) = { 



for < x < 3, 
otherwise. 



It has been observed by several authors that at this point the cascade approxi- 
mants converge merely weakly; see [BEJ99, Figure 6], [CoRy95, Figure 3.3], [Str96, 
Note 4], [Coh92, Figure 4]. When using the cascade algorithm to depict ip^ for 8 
near ^, this weak convergence seems to persist; see Figures 5-6. So let us compute 
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the Ruelle operator in (2.25) R as a 7 x 7 matrix on P[— 3,3]. The result is the 
following slant- Toeplitz matrix: 



6 

c b 

c 1 c 6 

R= & c 1 c b 

6 c 1 c 

6 0c 

6 



where 6 = 0300 and c = aiao + a 2 ai + 0302, and we have used (2.2). Viewing 
a = (a , a 2 , a 3 ) as a function on Z 4 , and letting T denote cyclic translation on 

Z 4 , 

T (a , ai, a 2 , a 3 ) = (ai, o 2 , a 3 , o ) , 
the relations (2.2) take the form 

(a,a) = l, (a,T 2 a)=0. 

Furthermore 

(a,Ta) = (a,T 3 a) = b + c, 



and by (2.3), 



^T fe a- (j2^,Y, a ^Y, a »Y, a i) = V^(l, 1,1,1). 



k=0 



Thus 



a,J2 Tka ) = ( (ao, 01,02,03), (1,1, 1,1)) = V2^a 4 = 2. 



\ fe=o / 
On the other side, 

/ 3 \ 3 



a, Tka ) = Yl ( a ' Tk °) = 1 + + c) + + (6 + c) = 1 + 2 (6 + c) . 



fc=0 



so 



6 + c= 1/2. 

Since 6(0) = g [1 + 2sin0 — cos (28)], b(0) ranges over [—jq, 5] as ranges over 
[—§)§]■ The characteristic equation of R has the obvious roots 1 and 6, where 6 
has multiplicity 2. Using the flip symmetry of the matrix under the joint flip around 
the fourth column and fourth row, the remaining fourth-order polynomial factors 
into two second-order polynomials, and thus one deduces that the eigenvalues of R 
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1 




1 

2 


^^^1 + sin 9 
2 




, f 


TV ' 

2 

-1 


7T 
2 

sin 9 

\^^^---___2 
\ N -26(6>) 



Figure 1. Eigenvalues A (8) of R 

are 



Eigenvalue A (9) 


Range of A (0) 




1 


{1} 


6 (multiplicity two) 


[ — 16' 2] 




1 

2 


{1} 




-26 


["Li] 


1 + VI + 166 

4 


f (-f <*<-?) 

I ±±f^ (-!<<?< f) 




1 - VI + 166 
4 


' i±|M (-1 <0<-f) 

[ (-!<*<§) 


[ — 2' 4] 



We see that unless 6=5, i.e., = 1 is the unique peripheral eigenvalue, and 
it has multiplicity 1. By Theorem 2.5, the cascade approximants then do indeed 
converge to the scaling function. If 9 = ^, then 1 is an eigenvalue of multiplicity 
2 and —1 is an eigenvalue of multiplicity 1, and by Theorem 2.5 the cascades do 
not converge in L 2 -norm. The slow convergence near # = § can be explained 

by the fact that R then has eigenvalues differing from ±1 by O ((6 — §) ), so 
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the approximation after n steps is like I 1 — const. (6 — ^ ) j . This slow rate of 

convergence near -| is also clear from Figure 6(a)-(l). We should emphasize that 
the semiregular layers displayed by these pictures dissolve more and more when 
doing further iterations and a plot after 1000 iterations with a resolution of 2~ 10 
shows virtually no discernible small-scale regularity. This is shown in Figures 7-9, 
and let us explain how these were produced. First note that by (2.9), the value 
of (Mip) (x) at a point x e 2 _JV Z only depends on the values of tp (y) at points 
y e 2~( JV ~ 1 )Z, and hence the exact values ip^ for x e 2 _JV Z can be determined 
exactly by an iterated matrix scheme. Also the jumps of ip^ at the points in 
2~ N Z can be determined exactly by the following cascade scheme with fixed N 
(e.g., N = 10): We define 



n- 2 



-N 



0,1,. ..,3-2 



N 



and define 

V4 m) («) 

where 



lim V (m) (x) , 



Vi m) (n) 



lim V (m) (x) , 



V> (0) (x) 



1 < X < 1, 

1< x < 3, 



and 

and 
(3.3) 



V> (m) (x) = (M"V (0) ) (&) = M^ (m_1) (x) 



Mip (x) = V2 a k ip (2x - fc) . 



k=0 



We start the recursion with 



(0) 1 forn = 0,l,...,2--l, 

+ { ' 10 for n = 2*2" + !,. ..,3- 2' 



and 



and from (3.3), 
(3.4) 



^ 0) (n) = { 



(n) - V2ELo «^r _1) (2n - A ■ 2 



forn = 0, 

1 for n= 1,2,..., 2 N , 

forn = 2 Ar + l,...,3-2 A ', 



(m-l) 



^ („) = V2ELo afcV^" 1 ' (2n - fc • 2") 



where we use the convention that 



v4 m ~ 1) (o = o 



if I < or I > 3 • 2 



AT 



In Figures 7-9, we have used this algorithm to plot ip^ 000 ^ (n), t/j^ 1000 ) (n), 
^(iooo) _ ^,(1000) ^ ^ for ^ = 10; 6/ = 97r /20. If 1000 is replaced by 100 the large- 
scale plots of ^i 100 ^ and ip^ 00>> look similar, but ip+ 00 ^ — f//j_ 100 ' ) is much larger 
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than 



^(1000) _ ^(1000) 



< 5 • 10 6 . Thus the plot shows an amazing amount of 
local continuity even though the larger-scale behaviour is quite irregular. However, 
the scaling function at 9 = 97r/20 is indeed discontinuous by the discussion below. 
This apparent non-compatibility is explained in detail in the Appendix. 

Remark 3.1. In [BEJ99], we noted that there are two neighborhoods on the circle, 
one near 9\ = and a symmetric one near # 2 = — f , such that each of the scaling 
functions tpW has x 1— > tp^ (x) continuous when 9 is in the union of the two 
neighborhoods. On the other hand if < 9 < |, then \/2a^ > 1 so at the right- 
hand endpoint x of each of the dyadic partitions, we will have (M n ip(°>) (x) — ► 

v ' n— too 

00. It then follows from [Dau92, Proposition 6.5.2 and footnote 9] or [DaLa92] that 
x 1 ► tpW (x) cannot be continuous for 9 in the first quarter circle. By the symmetry 
( x ) _ ^(B) (3 _ j;^ it i s then also not continuous in the second quarter of the 
0-circle. 

The question of continuity of ip^ has been considered in even more detail for 
our examples in the papers [CoHe92], [CoHe94, Section 4.8.1], [Wan95], [Wan96]. 
They use the real coefficients 

(3.5) Cn = V2a„, 

which then satisfy 

(3.6) 



^ CkCk+2l = 2(5;, 
k 

ck = 2 ' 



or, equivalently, 

(3.7) c + c 2 = 1 = ci +c 3 , 

so the scaling operator M in (2.9) becomes 

3 

(3.8) (M^)(x) = yc k ^(2x-k), 



/ 1 

k=0 



where 



(3.9) 



see an illustration in Figure 2. In [Wan96, Proposition 2.1], it is stated that if there 
is a continuous scaling function, then |co| < 1 and | C3 j < 1, that is, tt < 9 < 2tt. 
Since our movie reel shows that the scaling function clearly is discontinuous for 
< < \ and then by symmetry for < 9 < it, this is consistent with the 
movie reel. (It is still an open question, for < 9 < n, as to "how discontinuous" 
x 1 ► tpW (x) then is. Based on graphics, and analogies (see Section 4) to iterated 
function systems, it is likely that the discontinuous cases have interesting fractal 



co = 


i (1 — cos t 


1 + sin 9) , 


Cl = 


^ (1 — cost 


>-sin0), 


C2 = 


i (1 + cos£ 


>-sin0), 


C3 = 


i (1 + cos£ 


) + sin 9) ; 
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Ck(9) C 3 C C\ C 2 



1 











0\ / \ / 11 \ / \ / 1~k 



9 



ci c 2 c 3 c 



Figure 2. c fe (9) = V2~4 e) , fc = 0,1,2,3. See (3.9). 



structure, but that will be postponed to a later paper.) On the other hand, the 
comment to Figure 4.3 in [CoHe94, page 193] indicates that the condition 9 e 
(tt, 37r/2) U (3tt/2, 2tt) is necessary and sufficient for a continuous scaling function. 
This is indeed consistent with our movie reel, but the reel shows extremely singular 
behaviour of the scaling function at some dyadic rationals even in the domain of 
continuity. The effect is most pronounced for Figures 3(b) and 3(j). We refer to 
the papers above and [DaLa92] for the actual methods used to establish continuity. 

The flip symmetry of the example above holds more generally whenever the 
Ruelle matrix is defined from a low-pass filter function mo (z) = &o+&iz+- • -+a,NZ N 
with real coefficients a^. Specifically let R denote the associated Ruelle operator. 
We then have 

Lemma 3.2. 

(RO (z- 1 ) = (Re)(z), 



where C (z) ^^z^^eC [z, z" 1 ] , z e T. 
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Proof. From the definition of R we have 



2 



= ^EI-o(-- 1 )| 2 e(-- 1 ) 

It? 2 — Z 

2 I 2 



2 



where we used the reality assumption in the form m (w x ) = m (w). □ 

We now turn to the graphics which illustrate the cascade approximation (2.9)- 
(2.10). It follows from (2.1) and [Dau92, pp. 205-6] that this can be based on the 



matrix 











°\ 




ai 


a 








«3 


a 2 


ai 


\o 








03/ 



, or alternatively on just the framed 2x2 matrix 



a3 a2 



The so-called cascade algorithm [Dau92, p. 205] starts with initial points on an 
integral grid, and the n'th step fills in points on 2 _n Z places according to matrix 
multiplication and use of the weights from the respective matrix entries. We include 
more technical points in the captions of the pictures, and in the Appendix. 

Remark 3.3. While the results above concern primarily orthogonality and L 2 - 
cascade approximation, there is a direct connection between the L 2 -theory and 
pointwise features of the approximation, as is pointed out in [Dau92, p. 204]. Propo- 
sition 6.5.2 in [Dau92] makes that explicit when the a priori assumption is made 
that ip^ (x) is continuous in x. As noted in Remark 3.1 and Section 4, and in 
[BEJ99], such continuity is only known when 9 is restricted to certain subinter- 
vals of (— 7r, 7r], and the pictures serve to illustrate the features when 9 is in the 
complement of the "good" regions. 

We stress that the fast algorithm used for some of the graphics (see the Appendix) 
does in fact depend on the orthogonality of the family {(p^ (x — n) \ n e Z} in 
I? (K) , and this orthogonality we verified in [BE J99] to be satisfied for all values 
of 9 except for 9 = j . The significance of this orthogonality is also directly related 
to the assumption made in Theorem 2.5(h), i.e., (2.24), on the starting function 
for the cascade approximation (2.12), but the fast algorithm of the Appendix 
is different from (2.9)-(2.12). Details on the comparison of the two are to be found 
in the Appendix. 

List of Figures 

1 Eigenvalues A {9) of R 16 

2 Ck (9) = V2af\ k = 0, 1, 2, 3 (see comments below) 19 

3 Scaling function "movie reel", 9 from to ^ in twenty-one frames (see 
comments below) 22-25 

4 The to — > oo limit of ^ (m ^ (x) for 6 = -|f 26 

5 Initial function ip(°\x) = X[o.i](a;) for the cascade series (Haar scaling 
function) 27 
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6 Cascade stages of scaling function, 6 = |^ (see comments below) 28-30 

7 ^ 1000) (n)atn = 0,l,...,3-2 10 , x n = n ■ 2' 10 31 

8 ^ 1000) (n) at n = 0,l,...,3-2 10 , x n = n ■ 2' 10 31 

9 (V^ 1000) (n)-^ 1000) (n)) atn = 0,l,...,3-2 10 , x„=n-2- 10 32 

10 V'± o) (n) at n = 0, 1, . . . , 3 • 2 10 , x n =n- 2~ 10 32 
In Figure 6(a)-(l), we study the cascade approximation at 9 = |^ with the 



Haar function as starting point. Special attention will be given to the asymptotic 
properties of the jumps at certain dyadic rationals. The rightmost term in the sum 
expansion for M n ^ Q \x) = E; ^(n)^ (0) (2™ x — i) is 

h Nn (n)^(2 n x-N n ), 

where N n = 3(2Hn - H 1-2 + 1) = 3-(2iZn-l), and where V2a 3 > 1, h Nn {n) = 

2f oig — ► oo, and each of the previous subpartition highpoints hN n -kz{n) contains 

n—>oo 

a term with a factor 2 2 z L a 3 l ~ k — ► oo, k = 0,1,2,.... For example, note that 

n— >oo 

^(n) = M n^(0) is SU pp 0rte d i n [0,a;„), where x n = 3 - 2 1 "™, and 

lim (») = (V2a 3 ) n — > oo. 

Figure 6(a): = M^{x) = v^Ei a »^ (0) ( 2a; ~ cf - ( 2 - 9 )- The a *' s 

are given by (3.2). 
Figure 6(b): ^ 2 \x) = M 2 ^\x) = 2E fc E* a 4 afc- 2l ^ (0) (4^ - k) = 

E fc ^(2)V (0) (4x-fc). 
Figure 6(c): ^(x) = M 3 V> (0) (*) = 2§ E fe E« <W fc _4i- 2j V> (0) (8a: - k) = 

E fe ^(3)V- (0) (8^-fc). 
Figure 6(d): V (4) (z) = M 4 i|i<°)(i) = E fe M 4 )^ (0) (2 4 x - k), where the height 

of the rightmost column is h^A) = (\/2a 3 ) 4 , and further high points at 

/i42(4), /i3g(4), /i36(4), . . . , corresponding to rightmost "bumps" in the 

subpartition intervals. 

Figure 6(1): The n = 12 case of ipW(x) = M n ip(°\x) = 

J2 k h k (n)^(2 n x-k), where 2- n E fc M«) 2 = 1 1 ^T" V (0) 1 1 1 1 1 V' C0) 1 1 1 - 1 by 
(2.81), despite the divergence at the high rightmost "bumps", n — > oo. 

Additional comments on the Figures: The numerical sizes of the y-coordinates of 
the cusp points in Figure 3 are computed and displayed in the Appendix. Note 
that figures 3(b)— 3(j) represent continuous functions, while all other figures repre- 
sent discontinuous functions [CoHe94, p. 193]. See more details of Figure 3(b) in 
Figure 4. 

See http : //cm . bell-labs . com/who/wim/cascade/ for more pictures, but with- 
out emphasis of the singularities at dyadic points. 

In Figure 2, note that everywhere except at the four points 8 = k = 0, 1, 2, 3, 
we have precisely three positive a,'s and one negative one. The significance of that 
is discussed in Section 4 below. 
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(4.? ■ . 



i 

o: 




Figure 3. Scaling function "movie reel", 9 from to ^ in 
twenty-one frames: Frames m-r 
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s: e 



2tt 

5 



t: 



20 



(see also Figure 6(h)) 



u: 0=f 
(averaged) 



Figure 3. Scaling function "movie reel", 9 from to f in 
twenty-one frames: Frames s-u 
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Q 1 2 3 



(detail) 



Figure 4. The m — » oo limit of (x) for = 
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Figure 5. Initial function tp(°\x) — X[o,i]( x ) f° r the cascade series 
(Haar scaling function) 
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a: Cascade stage 1 



b: Cascade stage 2 



If- 



tf 



12 

c: Cascade stage 3 



d: Cascade stage 4 
Figure 6. Cascade stages of scaling function, = Stages 1-4 
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Figure 6. Cascade stages of scaling function, = |£: Stages 5-8 
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A.Jlli»i-.-l.ir*£ij,3 



i: Cascade stage 9 



j: Cascade stage 10 





k: Cascade stage 11 1: Cascade stage 12 

Figure 6. Cascade stages of scaling function, 9 = |^ : Stages 9-12 
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- . . ii- :? .1 -- — • * 



Figure 7. The figure above shows i/'+ 000) (") at n = 0, 1, . . . , 3 • 
2 10 , x n = n ■ 2- w {6=%). 




''.os 



Figure 8. The figure above shows ^ 1000) (n) at n = 0, 1, . . . , 3 ■ 
2 10 , x n =n- 2- 10 (0 = f ). 
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10 



10 



-1 . 10 



-4 . 10 



rj^rt if^n-- - i^BT ji"Er ^ ■ " -^f— * u- ' * - 



Figure 9. The figure above shows ^^ 1000 ^ (n) — ^f 1000 ^ (n)^ at 
n = 0, 1, . . . , 3-2 10 , x„ = n-2~ 10 . Note the small scale at the y-axis, 



showing 



, (1000) , \ 



^ {Wm) (n) 



< 



See the Appendix for 



1000000 • 

a discussion of this convergence of the differences to zero. 



U .3 



Q .6 



Q.4 




;_r.-sv t? -.- -.i - ■ * 



Figure 10. The figure above shows V4 (n) at n — 0, 1, . . . , 3-2 10 , 
x n = n- 2- 10 (8 = 
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4. Conclusions 

While it may be difficult to discern an overall pattern in the computer-generated 
output from the cascades, the expectation is that there are domains of starting 
points (functions or coefficients) which lead to "nice" limit functions, L 2 (R) or con- 
tinuous, while the other extreme ones lead to fractal-like pictures. This is based on 
two analogies, in addition to the existing results in [DaLa92], [CoHe92], [CoHe94], 
[Wan95], [Wan96]: 

(i) First, we think of (2.1) as a version of an iterated function system in the 
sense of Hutchinson [Hut81]. Hutchinson considers Borel mappings fi, ■ ■ ■ , f n on a 
complete metric space X, probabilities p\ , . . . ,p n , Pi > 0, J2iPi — 1- Such a system 
defines a dynamical system xq — > x\ — > • • • in X where 

Xi+l = fai (Xi) 

and where the indices on are chosen randomly for each i, with probability p ai . 
For Borel probability measures /i on X, set Mj (/i) = fi o /r ; i.e., Mj (/i) (E) = 
/x (fr 1 (E)), where Sclisa Borel set, and f^ 1 (E) = {x e X \ fi (x) e E}. A 
fixed point for the system is a measure \x such that 

(4.1) /x = ^p i M i (ii), 

i 

and we think of (2.1) as a version of this, but of course, in (2.1) we do not nec- 
essarily have the coefficients positive, and we do not impose the same normal- 
ization. Nonetheless, we may take X = R, fi(x) := ^(x + i), and pi := -^ai, 
i = 0,1, ... ,N. (Figure 2 shows examples with four coefficients, where generi- 
cally three are positive and one is negative.) Hutchinson's theorem for the general 
version of (4.1) yields existence and uniqueness of ^ provided the probabilistic as- 
sumptions hold, and the mappings fi have contractive Lipschitz constants. Even 
in the wavelet setting, one may ask for a signed measure fx solving (4.1) and make 
the distinction between solutions [i which are absolutely continuous with respect to 
Lebesgue measure, vs. the measures fi which are singular. In the first case, there is 
a Radon-Nikodym derivative ip which may be viewed as a solution to the original 
problem (2.1). 

(ii) The second analogy is to a problem studied by P. Erdos [Erd40] for an 
iterated function system on 

A" =[0,1], 0<A<1, 
fo (x) = Ax, 

( 4 - 2 ) /i (x) = Xx + l- A, and 

1 

Pa =Pi = 2' 

The corresponding problem (4.1) and (4.2) leads to a probability measure and 
Erdos showed that n\ is either absolutely continuous, or else totally singular. Total 
singularity means that there is a subset E C X (= [0, 1]) such that [i\ (E) = 1 and 
E is of zero Lebesgue measure. This shows up in computer output as fractal-like 
appearance. It is known that if A < \, then the support of fi\ is a Cantor set 
of Hausdorff dimension — y^j, while the case A > | is not fully understood, the 
expectation being that will be more regular (or "less fractal") if A is closer to 1. 
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The case A = \ in the Erdos construction is clear, of course, and yields d/ii (x) = 

ip (x) dx where ip — is precisely the Haar function from (2.13) above, and dx 
is Lebesgue measure, i.e., ip satisfies 



(|) =<p(x) + (p(x-l), 



which is also (2.1) for the Haar scaling function. 

It should also be mentioned that Solomyak [Sol95] proved that for almost all A 
in (5,1), the measure [i\ does have an L 2 (R) density. An example of A > i when 
[i\ is known to be singular is A -1 = (l + \/5) /2, the golden ratio. 

Appendix 
by Brian F. Treadway 

By the method of Daubechies [Dau92, Section 6.5, pp. 204-206], the iteration 
(3.4) can be replaced by a "local" iteration in which the two constant values of 
V> (ro) (x) on adjacent intervals at the left of the point x = k ■ 2 (m 1 >, 

: = V> (m) (x) for x e [(2k - 2) • 2" m , (2k - 1) • 2-" 1 ) , 

^<£>i := V' (m) (x) for x e [(2k - 1) • 2" m , 2k ■ 2~ m ) , 

are determined as a linear combination of two constant values of ^>( TO_1 ) (x) on 
adjacent intervals at the left of the same x from the previous iteration, viz. 

iP { ™- 1] := V (m) (x) for x e [(jfe - 2) • 2-< m - 1 \ (k - 1) • 2-( m - 1 >) , 

Vfc 1 ' ■= ^ (m) (x) for x € [(A; - 1) • 2-< m - 1 \ jfc • 2-< m - 1 )) . 
The relation is 



with 



•\/2ci2 \/2ao 
\/2a3 \/2ai 

where the a.j are as in (3.2). With the definition 

/ ,(m-l+i)\ 

(A.2) ^- 1+i := Vl4 , 

V^(fc-2i)-l7 

the relation (A.l) becomes 

The matrix relation (A.l) is in fact a transcription of the equations (6.5.9) and 
(6.5.10) in [Dau92], and the reader should be warned that the relations are not 
merely an algebraic consequence of the cascade iteration (2.9), but depend heavily 
on the orthogonality of the translates of the successive iterates ip( m \ See Re- 
mark 3.3 for comments on this dependence on orthogonality. The relation between 
Daubechies's iterates and ours is that 



V4 m) (n) = (M™ X[ o,i)) (» • 2"") = t 



(m) 

2~ N + m 1 
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at least when to > M. Thus, when Daubechies's scheme is iterated more than N 
times, the two methods give the same values on Z2~ N . In order to get the values of 
ip at dyadic rationals it is necessary to switch from Daubechies's algorithm, which 
doubles the number of points at each iteration, to the algorithm based on (2.9), 
where the number of points is constant. The extreme cusps in the movie reel Figure 
3(b)— (j) are therefore computed by using (2.9) with large n's, or by computing the 
limit n — > oo as in (A. 4) below. The connection between the two versions of the 
cascade algorithm is spelled out in [Dau92, p. 204], where equation (6.5.3) shows 
that the value of ip at the dyadic rational n ■ 2~ N may be approximated by V^'-w 

as j — > oo where ipm = 2^ 2 {ip \ tp-j, m ) an d <Pj,m ( x ) = 2~^ 2 <p (2~^x — to). When 
the approximations at a given stage are written out in terms of products of the 
coefficients dj in the order they arise in the iteration, it is found that the two 
algorithms supply these factors in precisely the opposite order. 

If sin 6 7^ —1, the eigenvectors of A = ( ^? a2 v ^ a ° ) are: for eigenvalue Ai = 1, 

V V 2a 3 V2ai / 

ei = (}); for eigenvalue A 2 = — sin#, e 2 = (yl" 1 )- A starting vector £ m _i = 
composed of ordinatcs in the (to — l)'st cascade approximant gives, by 



A £m— 1 — £m— 1+m 



Vk-2 

n applications of A 



where £ m _i +n , given by (A. 2) above with i = n, is composed of ordinates in the 
(to — 1 + n)'th cascade approximant on two intervals at the left of the same point 
x = k ■ 2~(" l ~ 1 ). Expanding the starting vector £ TO _i in the two eigenvectors, we 
get 



£m-i = ctiei + a 2 e 2 

, (m— 1 

J k-2 ' - v 



y/2 (a 2 - oi) 

-*Z> (m_1) +ib {m ~ 1] 
Vk-l ^ Vfe-i 

H 7z~, ; e2 - 

V2(a 2 - ai) 



-ei 



Thus 



-ei 



(A. 3) £m-l+n — ^4"Cm-l 

= A™aiei + A 2 Q! 2 e 2 

V2 (a 2 - ai) 

+ (-sin0) — e 2 . 

V2 (a 2 - oi) 

This form allows us to read off the n —* oo limit directly except in the two special 
cases 9 = ±-|, as the second term on the right-hand side of (A. 3) vanishes in the 
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limit unless sin 9 = — 1 (already excluded) or sin 6 = 1 : 
(A. 4) lim £„ = lim £ m _i+„ 

V2(a 2 ai) ^02^2 ^ - V2a 1 ^"T 1 V ' 

Note that the two components of this limit vector are equal, so the "jump" 
in ip disappears at the dyadic rational point x in question. This does not imply 
that ip converges to a continuous function ip. While the values of ip( m - 1+n ) (#) on 
the two intervals in the stage- (m — 1 + n) partition at the immediate left of the x 
in question do move closer together as n increases until their difference becomes 
zero in the limit, values at higher-order dyadic rationals are not so well behaved: 
for example, we can see, by extending the treatment above to a 3 x 3 matrix 

(V2a 3 \/2oi \ 
o V2a 2 V2a ) iterating three intervals at the left of the point x instead 

of just two, that the value of ^>( m_1 + n ) (x) on the third interval added at the left 
of the original two generally grows without limit as n — > oo if the new eigenvalue 
V203, corresponding to the eigenvector (o), is greater than 1, which is the case 
for0<6»<f. 

As an example, the negative "peak" at x = 1 (sec Figure 3(b)— (j)) can be 

( ip m \ 

obtained by using the starting vector £o = ( J = ( l ) (following the convention 

that V'*-™^ i x ) f° r x < for all m, and thus ip^\ = 0; see Figure 5): then the two 
components of lim^oo are both equal to 

\/2ai 1 — cos 9 — sin 9 



v/2(a 2 -oi) 2 cos 6 



= \ I 1 + tan ■ 



Similarly, the peak at x = 2 in the same plots can be obtained by using the starting 



(0) 



vector £ = ( ^< ) J = (o) ( see Figure 5); it is 

\/2a 2 I + cos 9 — sin 9 



V2(a 2 -ai) 2cos# 
1 , 

1 — tan ■ 



2 V 2 

Note that these two peaks sum to 1 for all 9 (excluding if). The limiting value at 
x = 3/2 is also independent of 9. For this, use the starting vector £i = 

(vlaO ( SGe Fi S ure 6 ( a ))' This is -i ust e2 ' so 



_ ( ^ \ _ 



lim ^„ = 
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The numerical values of lim^oo ip( n '> (x), x — 1, |, 2 for the "movie reel" plots in 
Figure 3 are given below. 



Fig. 3 6 


X = 1 


2 

X = j 


x = 2 


(b) 


9tt 
20 


-5.8531 





6.8531 


(c) 


2tt 
5 


-2.6569 





3.6569 


(d) 


7tt 
20 


-1.5826 





2.5826 


(e) 


3tt 
10 


-1.0388 





2.0388 


(f) 


7T 

4 


-0.7071 





1.7071 


(g) 


7T 

5 


-0.4813 





1.4813 


(h) 


377 

20 


-0.3159 





1.3159 


(i) 


7T 

10 


-0.1882 





1.1882 


0) 


7T 

20 


-0.0854 





1.0854 


(k) 





0.0000 





1.0000 


(1) 


7T 

20 


0.0730 





0.9270 


(m) 


7T 

10 


0.1367 





0.8633 


(n) 


3tt 
20 


0.1936 





0.8064 


(o) 


7T 

5 


0.2452 





0.7548 


(P) 


7T 

4 


0.2929 





0.7071 


(q) 


3jt 
10 


0.3375 





0.6625 


(r) 


7tt 
20 


0.3800 





0.6200 


(s) 


2?r 
5 


0.4208 





0.5792 


(t) 


9tt 
20 


0.4606 





0.5394 



These values can be observed in the plots, though for 6 near | the cascade level 
plotted (to = 8) is not high enough to show a close approach to the limit. 
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